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Coronaviruses possess the largest known RNA genome, a 27- to 
32-kb (+)-strand molecule that replicates in the cytoplasm. During 
virus replication, a 3’ coterminal nested set of five to eight subge- 
nomic (sg) mRNAs are made that are also 5’ coterminal with the 
genome, because they carry the genomic leader as the result of 
discontinuous transcription at intergenic donor signals during 
(—)-strand synthesis when templates for sgmRNA synthesis are 
made. An unanswered question is whether the sgmRNAs, which 
appear rapidly and abundantly, undergo posttranscriptional ampli- 
fication. Here, using RT-PCR and sequence analyses of head-to- 
tail-ligated (—) strands, we show that after transfection of an in 
vitro—generated marked sgmRNA into virus-infected cells, the 
sgmRNA, like the genome, can function as a template for (—)-strand 
synthesis. Furthermore, when the transfected sgmRNA contains 
an internally placed RNA-dependent RNA polymerase template- 
switching donor signal, discontinuous transcription occurs at this 
site, and a shorter, 3’ terminally nested leader-containing sgmRNA 
is made, as evidenced by its leader-body junction and by the expres- 
sion of a GFP gene. Thus, in principle, the longer-nested sgmRNAs 
in a natural infection, all of which contain potential internal tem- 
plate-switching donor signals, can function to increase the number 
of the shorter 3’-nested sgmRNAs. One predicted advantage of this 
behavior for coronavirus survivability is an increased chance of 
maintaining genome fitness in the 3’ one-third of the genome via 


a homologous recombination between the (now independently | 


abundant) WT sgmRNA and a defective genome. 


bovine coronavirus | discontinuous transcription | negative-strand RNA 
ligation | negative-strand RNA synthesis | severe acute respiratory syndrome 


y virtue of an RNA-dependent RNA polymerase (RdRp) 
template switch, coronaviruses, which include the severe acute 
respiratory syndrome (SARS) virus, generate a 3’-coterminal 
nested set of subgenomic mRNAs (sgmRNAs) that also contain 
the leader of the genome (1). The leader is 65-90 nt long, 
depending on the species of coronavirus, and the template switch 
likely occurs during synthesis of (—)-strand templates for sgmRNA 
synthesis (2, 3) (Fig. 1, Upper), although some switching during 
(+)-strand synthesis (4) may occur as well (5). With few excep- 
tions, sgmRNAs generated from 3’-proximal template- switching 
donor sites on the genome are progressively more abundant [up to 
70-fold more at the peak time of RNA synthesis, at 6-8 h post- 
infection (6)] than those generated from 5’-proximal sites. The 
sgmRNAs are translated to virion structural proteins or to non- 
structural accessory proteins, both of which may function as viru- 
lence factors or as inhibitors of host immune responses (7, 8). 
The function of the common leader on sgmRNAs is not 
known, but we postulated that it [in its (—)-strand form, called 
the antileader}] serves as a promoter for sgmRNA replication, 
providing a means for sgmRNA amplification (6, 9). The repli- 
cation model was deemed feasible because the antileader is 
longer than the reported <20-nt replication promoters in toga- 
viruses and orthomyxoviruses (10, 11). The sgmRNA replication 
hypothesis led to a search for (—)-strand copies of coronavirus 
semRNAs in porcine transmissible gastroenteritis virus (TGEV) 
and bovine coronavirus (BCoV) (6, 9). The presence of ssgmRNA 
(—) strands and their properties (6, 9) seemed to be consistent 
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with a sgmRNA replication model, because (i) the rates of ap- 
pearance of sgmRNA (+) and (—) strands were inversely related 
to their length (6, 9); (i) sgmRNA-length dsRNA replicative 
intermediates [also called transcriptive intermediates (12)] were 
found that were (presumably reiteratively) active in (+)-strand 
sgmRNA synthesis in TGEV and mouse hepatitis coronavirus 
(MHV) (9, 12-14); and (iii) the 3’-terminal 55 nt and poly(A) 
tail together were found to be sufficient for (—)-strand synthesis 
from a defective interfering (DI) RNA in MHV (15). 

Experiments that directly tested whether a reporter-containing 
sgmRNA underwent replication after transfection into helper virus- 
infected cells did not support a sgmRNA replication model for- 
BCoV (16) or MHV (17), however. For BCoV, no accumulation of 
the reporter-containing 1.7-kb sgmRNA (shown as sgmRNA’-1 in 
Fig. 2) was found when compared with that for a cotransfected 
helper virus-dependent replicon, a 2.2-kb DI RNA containing the 
same reporter (shown as DI RNA-1 in Fig. 2) (16). 

In the present work, using BCoV as a model, we reexamined 
the potential for sgmRNA amplification by a mechanism other 
than replication and postulated that if sgmRNAs function as 
templates for (—)-strand synthesis as does the genome, then 
perhaps internal template-switching donor signals on the longer 
sgmRNAs during (—)-strand synthesis would lead to production 
of shorter internally encoded sgmRNAs (Fig. 1, Lower). Using 
RT-PCR and sequencing of head-to-tail-ligated (—)-strand 
molecules, we demonstrated that a marked synthetic ssmRNA 
can function as a template for (—)-strand RNA synthesis, and 
using RT-PCR sequencing of leader—body junctions and ex- 
pression of a reporter GFP from a sgmRNA, we demonstrated 
that a template-switching signal within the sgmRNA can lead to 
the generation of a shorter internally nested sgmRNA. We found 
no evidence to indicate that the transfected sgmRNAs within the 
24- to 72-h duration of the experiments, in the absence of exper- 
imentally applied selection pressures, had recombined with the 
helper virus genome in such a way as to generate the shorter re- 
porter-containing sgmRNA from the genome. Our findings in- 
dicate that in principle, coronaviral sgmRNAs, once made from the 
genome, can be amplified by acting as templates for a cascading 
transcription process. We speculate about the biomedical impli- 
cations of this behavior, including how it might aid in the survival of 
the large coronavirus genome. 
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Fig. 1. Model for coronavirus subgenomic mRNA amplification. (Upper) 
Model for generating sgmRNA (—) strands (dashed gray line) from the viral 
genome (solid black line) during (—)-strand synthesis. In this model, the 
coronaviral RdRp pauses at intergenic template-switching donor signals 
(vertical black bars) and is transferred by a copy-choice mechanism to 
a highly similar acceptor site near the 5’ end of the genome to copy the 
5’-terminal leader. In this way, a complete (—)-strand template for sgmRN A 
synthesis is made. Synthesis of a full-length (—) strand (the antigenome, not 
depicted) would be used for genome replication. (Lower) The model for 
subgenomic MRNA amplification tested in this study. This model proposes 
that the RdRp sees the sgmRNA (solid black line) as a small genome, initiates 
(—)-strand synthesis on it, and switches template at an internal donor signal 
to make a (—)-strand template (dashed gray line) to synthesize a shorter 
internally nested sgmRNA. The leader is indicated by a filled box. An open 
box indicates the (—)-strand copy of the leader (called an antileader). Note 
that in this figure, sgmRNA 6 also will come from template switching on the 
viral genome (not depicted), as well as from sgmRNA 3 (depicted). 
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Fig. 2. Positive-strand RNA molecules used as templates for testing 
(—)-strand synthesis and sgmRNA production in helper virus-infected cells. At 
the top is a truncated diagram of the BCoV genome indicating its parts that 
are present in a naturally occurring BCoV defective interfering RNA (WT DI 
RNA). RNA molecules to be transfected were generated in vitro with T7 RNA 
polymerase from Mlu [linearized plasmid DNAs. Reporters are T, a 30-nt 
in-frame sequence derived from the transmissible gastroenteritis coronavi- 
rus N gene; MHV 3’UTR, the 301-nt 3’ UTR from mouse hepatitis virus-A59; 
EGFP, the 720-nt GFP gene in the —-1 reading frame with respect to the 
upstream DI RNA and N ORFs; and gD, a 92-nt sequence from the herpes 
simplex virus gD gene. IS denotes the 18-nt intergenic sequence containing 
the heptameric template-switching signal UCUAAAC. The 65-nt BCoV leader 
is identified by a solid black rectangle. The position of the 421 nt differen- 
tiating the DI RNA from sgmRNA 7 is shown. 
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Results 


Transfected sgmRNAs Function as Templates for (—)-Strand Synthesis. 
Northern blot analyses have proven feasible for detecting BCoV 
sg(—)-strand RNAs during natural infection, where the numbers 
of (+)-strand sgmRNAs range from ~20 molecules/cell for the 
viral genome to ~5,000 molecules/cell for sgmRNA 7 [the nu- 
cleocapsid (N) protein-encoding, 3’-terminal, and most abundant 
of eight sgmRNAs] at peak abundance (6-8 h postinfection) and 
the (—)-strand counterparts range from 20- to 50-fold less (6). 
These have not proven feasible for detecting the low numbers of 
sgmRNA (—) strands generated from DI RNA parents trans- 
fected into helper virus-infected cells where sgmRNA production 
for a given template-switching signal is far less robust; for ex- 
ample, for the ssmRNA 7 template-switching signal, it is ~13 


‘molecules per cell from DI RNA, versus ~2,000 from the viral 


genome at 24 h postinfection (18). Therefore, identification of 
sgmRNA (—) strands made from transfected (+)-strand tem- 
plates must rely on a more sensitive method, such as RT-PCR 
analyses, which demonstrate an increase in numbers over time, 
followed by sequence confirmation of the RT-PCR product. 

To determine whether a (+)-strand ssgmRNA transfected into 
BCoV helper virus~infected cells can function as a template for 
sgmRNA (—)-strand synthesis by viral RdRp, we used RT-PCR 
with cDNA-cloned BCoV sgmRNA 7 marked with the MHV 3’ 
UTR (sgmRNA’-2 in Fig. 2) to distinguish it from BCoV helper 
virus sgmRNAs. A 3’-proximal segment within the 301-nt MHV 3’ 
UTR, nt number 46-156 from the 3’ end [which differs by ~60% in 
sequence from the comparable region in the 288-nt BCoV 3’ UTR 
(19, 20), and which is identified as a hypervariable region among 
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Fig. 3. Strategy for detecting viral RdRp-synthesized (—)-strand products 
from marked transfected DI RNA-2 and sgmRNA’-2. (A) The MHV-specific 
PCR primer (primer 2) amplifies sequences from the MHV 3’ UTR, but not the 
BCoV 3’ UTR. (Upper) Diagram of the MHV 3’ UTR. The poly(A)-detecting 
oligo(dT,g) primer is used for reverse-transcription and PCR, and primer 2, 
which binds the MHV (-) strand complementary to nt 99-122 from the poly 
(A) tail in the (+) strand, is used for PCR. (Lower) A ~140-nt RT-PCR band 
(arrowhead) is produced with MHV RNA, but not with BCoV RNA. The 
conditions for the RT-PCR are the same as those described for Fig. 4, except 
that the number of cycles is 25 instead of 34. M, dsDNA molecular size 
markers; MHV, RNA from MHV-infected cells used for RT-PCR; BCoV, RNA 
from BCoV-infected cells used for RT-PCR. (B) Strategy for specifically 
detecting the head-to-tail-ligated RdRp-generated marked (—)-strand RNAs. 
The primers used do not generate an RT-PCR product from input copy-back 
(—)-strand RNA synthesized by the T7 RNA polymerase, but they do generate 
a product from head-to-tail-ligated (—)-strand RNA synthesized by the viral 
RdRp (Fig. 4). The vertical bar depicts the ligation site between the (-)-strand 
5’ monophosphorylated and 3’ hydroxylated ends. The MHV 3’ UTR-specific 
primer 2 shown in A is the same as the RT primer shown in B and also in 
Fig. 4. The (+)-strand-binding primers are depicted above the sequence, and 
the (—)-strand-binding primers are depicted below the sequence. 
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group 2 coronaviruses (19)], was exploited for specific RT-PCR 
differentiation between BCoV and MHV 3’ UTRs (Fig. 3A). 
Despite this difference, BCoV DI RNA with the MHV 3’ UTR 
(DI RNA-2 in Fig. 2) replicated well (Fig. S1), as anticipated, 
because the BCoV 3’ UTR supports replication in the MHV ge- 
nome (21); thus, we used it as a positive control for testing RdRp- 
dependent (—)-strand synthesis from transfected (+)-strand 
sgmRNA’-2 (described below). 

Because T7 RNA polymerase generates transcripts in vitro 
that can be up to a 1% copy-back sequence (22) (Fig. 3B), direct 
RT-PCR sequencing of (—) strands obtained from cells cannot 
differentiate between (—) strands made by T7 RNA polymerase 
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Fig. 4. Transfected sgmRNA’-2 marked with the MHV 3’ UTR functions as 
a template for (—)-strand synthesis. (A) Predicted results with transfected DI 
RNA-2 (control). (B) RT-PCR with primers described in Fig. 38 reveals a ~210- 
nt product of ligated (—)-strands from BCoV-infected cells (Center, arrow- 
head), but not from uninfected cells (Left) or from an in vitro ligation re- 
action with RNA from infected cells mixed with 1 ng of DI RNA-2 (Right). (C) 
Sequence of the cDNA-cloned ligated junction from 12 hpt {lane 11 of 8, 
shown in the (+) strand], in which the MHV 3’ UTR (ending in ...ATCAC3’) 
with the poly(A) tail is shown to be joined to the 5-base-truncated BCoV 
leader S’ end (S’TGAGC...) (vertical bar). (D) Predicted results with trans- 
fected sqgmRNA’-2. (£) RT-PCR reveals a ~210-nt product from ligated 
(—)-strands from BCoV-infected cells (Center, arrowhead), but not from un- 
infected cells (Left) or from an in vitro ligation reaction with RNA from 
infected cells mixed with 1 ng of sgmRNA7-2 (Right). (F) Sequence of the 
cDNA-cloned ligated junction from 12 hpt (lane 11 of £, showing the same 
features as the sequence depicted in C). M, ds DNA size markers in nt pairs. 
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(before transfection) and (—) strands made by viral RdRp (after 
transfection). Thus, we carried out head-to-tail ligation of RNA 
from cells and developed an RT-PCR primer set that uses the 
MHV 3’ UTR (—)-strand-specific primer (Fig. 34) and a BCoV 
leader (+)-strand-specific primer (Fig. 3B) to detect the head- 
to-tail-ligated RdRp-generated sg(—) strands (Fig. 3B). These 
primers do not yield a product from copy-back RNA (Fig. 3B 
and control data described in Fig. 4). These experiments pre- 
sumed that some fraction of the RdRp-generated ssgmRNA(-—) 
strands would be 5’ monophosphorylated and thus ligatable with 
T4 RNA ligase 1 (23). Although both circular [the expected 
predominant form (24)] and tandemly ligated molecules are 
possible after ligation, the results from both would lead to the 
same conclusion because of the specificity of the RT primer for 
the MHV 3’ UTR. A head-to-tail ligation method has been sim- 
ilarly applied to identify terminal features on influenza virus 
RNAs (24). 

In BCoV-infected cells transfected with DI RNA-2 (the pos- 
itive control) (Fig. 44), a ~210-nt RT-PCR product from puta- 
tive head-to-tail-ligated RdRp-generated (—)-strand RNA was 
found (Fig. 4B, lanes 9-14, arrowhead) that was not found in 
transfected mock-infected cells (Fig. 4B, lanes 2-7) or in control 
reactions with in vitro—ligated mixed components containing 
infected cell RNA and input DI RNA-2 (Fig. 4B, lanes 16-21). 
The sequence of the cDNA-cloned ~210-nt ligated product 
obtained at 12 h posttranscription (hpt) in BCoV-infected cells 
(Fig. 4B, lane 11) confirmed a head-to-tail molecule and dem- 
onstrated a 5’ poly(U) tail of ~30 nt (Fig. 4C). Interestingly, the 
3’ terminus of the BCoV leader (—) strand was missing five 3’- 
terminal bases, consistent with an incompletely understood se- 
quence hypervariablity reported previously for the (+)-strand 
BCoV leader (25). In BCoV-infected cells transfected with 
sgmRNA’-2 RNA (Fig. 4D), a ~210-nt product from head-to- 
tail-ligated, putative RdRp-generated (—)-strand RNA was 
found (Fig. 4£, lanes 9-14, arrowhead) that was not found in 
transfected mock-infected cells (Fig. 4£, lanes 2-7) or in control 
reactions with in vitro—ligated mixed components containing 
infected cell RNA and input sgmRNA-2 (Fig. 4£, lanes 16-21). 
RT-PCR sequencing of the cDNA-cloned ~210-nt product 
obtained at 12 hpt (Fig. 4£, lane 11) confirmed a ligated head-to- 
tail (—) strand and also demonstrated a 5’-poly(U) tail of ~30 nt 
and a 3’-terminal 5-nt deletion (Fig. 4F). Strikingly, in contrast to 
the pattern seen for DI RNA, the sgmRNA (-—) strands were 
expressed for a shorter period (6-18 hpt, with peak abundance at 
12 hpt), suggesting that replication of the ssmRNA/’-2 molecule 
might not occur or that if it does occur, it is short-lived. 

These findings also suggest that the initiation of (—) strand 
synthesis is likely derived from a cleaved primer within the 5’ 
poly(U) tail, because the (—)-strand RNAs were ligatable with 
T4 RNA ligase 1 and thus necessarily 5’-monophosphorylated 
(23) (Discussion). 

Because a high rate of homologous RNA recombination 
between coronavirus molecules is well documented (25% over 
the entire genome) (26-28), and, along with experimentally 
applied selection methods, served as the basis for the first re- 
verse-genetics system developed in coronaviruses (1, 29, 30), we 
conducted concurrent analyses of the helper virus genome to 
search for evidence of a recombinant genome that might have 
given rise to the observed marked ligated (—)-strand molecules. 
For this, the 6- to 24-h samples from DI RNA-2 and sgmRNA’-2 
that demonstrated head-to-tail-ligated (—) strands (Fig. 4 B and 
E) were tested by RT-PCR for a 1,639-nt 3’-proximal helper virus 
genome-integrated reporter-containing sequence between the 
membrane (M) protein gene and the 3’ UTR. This was done 
under PCR conditions (with a 90-s extension time vs. 30 s as used 
in Fig. 4 B and £) that can readily identify a 1,637-nt fragment 
from a helper virus genome (Fig. S2B, Top and Bottom, lane 15). 
None was found (Fig. S24, lanes 2-9), leading us to conclude that 
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Fig. 5. A short sgmRNA is produced from transfected sgmRNA’-3, which 
carries a transcription signal and EGFP gene. (A) Predicted results with DI 
RNA-3 (control). (B) RT-PCR reveals a 150-nt leader—body product from 
transfected BCoV-infected cells (lanes 17-24, arrowhead), but not from un- 
infected cells (lanes 2-11) or from infected cells alone (RT-PCR control a), 
from a mixture of RNA from infected cells at 13 h postinfection with 1 ng of 
DI RNA-3 (RT-PCR control b), or from an RT-PCR mix with primers alone (RT- 
PCR control c}). The 150-nt RT-PCR product is not observed in cells infected 
with progeny helper virus passages 1 and 2 (lanes 25 and 26) or in blind 
passages from uninfected cells (lanes 12 and 13). A 1,757-nt RT-PCR product 
from input DI RNA-3 is identified with an asterisk. (C) Sequence of the cDNA- 
cloned 150-nt RT-PCR product from lane 21 in B showing the leader—body 
junction (vertical bar), the heptameric template-switching signal (under- 
lined), and the AUG translation start codon (overlined) for EGFP. (D) Pre- 
dicted results with sgmRNA’-3. (E) RT-PCR reveals a 150-nt leader-body 
product from transfected BCoV-infected cells (lanes 17-24, arrowhead), but 
not from uninfected cells (lanes 2-11}, from infected cells alone (RT-PCR 
control a), from a mixture of RNA from infected cells at 13 h postinfection 
with 1 ng of sgmRNA’-3 (RT-PCR control b), or from an RT-PCR mix with 
primers alone (RT-PCR control c). The 1S0-nt RT-PCR product likewise is not 
observed in cells infected with progeny helper virus passages 1 and 2 (lanes 
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the sgmRNA/’-2 (—) strands detected in the experiment did not 
come from a recombinant helper virus genome. 


An intergenic Transcription-Associated (Template-Switching) Signal on 
the Transfected sgmRNA (+) Strand Leads to Generation of a Smaller 
Internally Nested Reporter-Containing sgmRNA. To test the hypoth- 
esis that sgmRNAs containing an internal template-switching 
signal can generate a shorter ssmRNA, we compared sgmRNA’- 
3 with DI RNA-3 (the positive control) (Figs. 2 and 5A). For 
sister constructs of DI RNA-3, sgmRNA generation from internal 
donor template-switching signals is well documented (18, 31, 32). 
From BCoV-infected cells transfected with DI RNA-3, a 150-nt 
nested RT-PCR product was obtained that first appeared at 1 hpt 
and was present throughout the 72-h experiment, but was most 
abundant at 6-48 hpt (Fig. 5B, lanes 17-24, arrowhead). A similar 
product was not detectable in mock-infected cells (Fig. 58, lanes 
2-11) or in in vitro mixtures of reaction components (Fig. 5B, 
lanes 27-29), indicating that the 150-nt product arose from an in 
vivo RdRp-generated internally nested short ssmRNA. The se- 
quence of the cDNA-cloned 150-nt molecule obtained at 12 hpt 
(Fig. 5B, lane 21) confirmed the expected sgmRNA leader—body 
junction (Fig. 5C). A 1,757-nt nested RT-PCR product from pri- 
mers binding within the transfected parent DI RNA was obtained 
with RNA from both infected and uninfected cells (Fig. 5B, as- 
terisk). From BCoV-infected cells transfected with sgmRNA/’-3 
RNA (Figs. 2 and 5D), a 150-nt RT-PCR product from the leader— 
body junction of the putative short reporter-containing sgmRNA 
was found that first appeared at 1 h and was present throughout 
the 72-h experiment, but was most abundant at 12 hpt (Fig. 5E, 
lanes 17-24, arrowhead). The 150-nt product was not found in 
RNA from transfected uninfected cells (Fig. 5£, lanes 2-11) or in in 
vitro mixtures of reaction components (Fig. SE, lanes 27-29). A 
1,336-nt nested RT-PCR product from primers binding within the 
transfected parent sgmRNA was obtained with RNA from both 
infected and uninfected cells (Fig. SE, asterisk). The sequence of 
the cDNA-cloned 150-nt product at 12 hpt (Fig. SE, lane 21) con- 
firmed its origin at the template-switching signal in ssmRNA/’-3 
(Fig. 5F). Interestingly, the relatively short-lived peak abundance of 
the 150-nt leader-body product from sgmRNA’-3 (maximal at 
12 hpt) compared with DI RNA-3 (maximal at 6-48 hpt) resembled 
that for the (—)-strand RNA synthesized from sgmRNA’-2 (Fig. 4E, 
lanes 10-12), suggesting that the transfected ssmRNA’-3 molecule 
might not be replicated, or if it is replicated, it is short-lived. 

In a second experimental approach, fluorescing cells were 
sought as evidence of EGFP expression from the internally 
encoded small sgmRNA. Because the EGFP ORF in both the DI 
RNA-3 and sgmRNA’-3 templates is in the —1 reading frame with 
respect to the upstream ORFs, fluorescence of EGFP as a fusion 
protein was not expected, and was not observed, in transfected 
uninfected cells (Fig. 5G, panels 1 and 3). It was found in ~0.125% 
of cells for DI RNA-3 and in ~0.00015% of cells for sgmRNA’-3 in 
the presence of helper virus (Fig. 5G, panels 2 and 4). The fluo- 
rescence in ~1,000-fold fewer cells for sgmRNA’-3 likely reflects 
the transient transcription observed from this construct. 

Two experimental approaches indicated that the small reporter- 
containing sgmRNA did not come from a recombinant helper vi- 


25 and 26) or in blind passages from uninfected cells (lanes 12 and 13). A 
1,336-nt RT-PCR product from input sgmRNA’-3 is identified with an asterisk. 
(F) Sequence of the cDNA-cloned 150-nt RT-PCR product from lane 21 in 
E showing the same features as described in C. (G) Fluorescence evidence of 
EGFP expression from the sgmRNA’-3-encoded short sgmRNA. Panels 1 and 
3 are mock-infected cells transfected with DI RNA-3 and sqgmRNA’-3, re- 
spectively; panels 2 and 4 are BCoV-infected cells transfected with DI RNA-3 
and sgmRNA’-3, respectively. In all cases, cells were examined for fluores- 
cence at 16 h posttransfection. 
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rus genome. In the first approach, RT-PCR tests on RNA 
extracted at 12 hpt from cells infected with progeny helper virus 
from transfected cells (identified as virus passages 1 and 2) dem- 
onstrated no evidence of the small reporter-containing ssgmRNA 
(Fig. 5 B and E, lanes 25 and 26). In the second approach, RT-PCR 
tests on RNA from the transfected cells described in Figs. 5 B 
and FE, lanes 15-24, in which an upstream M gene-specific 
primer replaced the leader-specific primer 2 in three separate PCR 
primer combinations, demonstrated no evidence of a recombinant 
helper virus genome (Fig. S2B). Together, these results indicate 
that a longer sgmRNA with an internal RdRp template-switching 
donor signal can serve as a template for generating shorter in- 
ternally encoded sgmRNAs. | 


Discussion 


We have demonstrated two previously undescribed features of 
coronavirus sgmRNA synthesis: (i) sgmRNA (+) strands can 
function as templates for sgmRNA (—)-strand RNA synthesis, 
and (ii) template-switching signals on sgmRNA (+) strands can 
trigger synthesis of internally nested (+)-strand sgmRNAs. The 
first finding is consistent with two previous reports describing 
putative requirements for initiation of (—)-strand synthesis in 
coronaviruses. Lin et al. (15) demonstrated by RNase protection 
assays that the 3’-terminal 55 nt along with the poly(A) tail in 
coronavirus DI RNA (and by implication, sgmRNA) are sufficient 
for (—)-strand synthesis in vivo. Ziist et al. (33) demonstrated, 
through genetic and phylogenetic evidence, that an intramolecular 
pairing between nine bases at the genome 3’ terminus and bases 
within loop 1 of the 3’ UTR pseudoknot mapping 220 nt upstream, 
even without the nt 30-170 sequence from the 3’ end, is required 
for MHV genome replication. The model of Ziist et al. (33) pro- 
poses that the nine-base interaction is necessary for assembly of 
the replication complex and initiation of (—)-strand synthesis. 
These authors further suggested that this required base pairing 
might have occurred in trans in the earlier study of Lin et al. (15). 
Because the polyadenylated 3’ UTR on sgmRNAs is identical to 
that on the genome, the 3’-proximal structural requirements 
for (—)-strand synthesis as defined previously (15, 33) would be 
expected to be met. The present study extends the observations 
on initiation of (—)-strand synthesis by showing that poly(U) is 
part of the nascent (—) strand. The identity of the primer used by 
the primer-dependent viral RdRp (nsp 12) remains unresolved 
by our experiments (34, 35). We presume that the 5’ monophos- 
phorylated U enabling head-to-tail ligation by T4 RNA ligase 1 
shown in Fig. 4 is the product of an as-yet-unidentified endonu- 
clease, because initiation of coronavirus (—)-strand synthesis likely 
uses a 5’ triphosphorylated single nucleotide primer [as would be 
the case for de novo initiation (36)] or an oligonucleotide primer 
[as would be made by the viral primase (nsp 8) (37)]. The coro- 
naviral endonuclease (EndoU) (nsp 15) is an unlikely candidate 
for this cleavage, because it leaves a 2’-3’ cyclic phosphate (38, 39) 
that is not ligatable with T4 RNA ligase 1 (23). The importance of 
the cleavage in the nascent (—) strand as observed here, if any, 
remains to be determined. 

We did not extend our analyses to explore how the genome- 
specific 5’-proximal (+)-strand 421 nt differentiating DI RNA 
from sgmRNA 7 (Fig. 2) (16) influences the initiation of (—)-strand 
synthesis. However, it clearly is not required in cis for initiation 
of sgmRNA (—)-strand synthesis, as has been postulated (3), al- 
though it might function to enhance initiation. Certainly the 
5‘-proximal 421-nt sequence contributes to long-term accumula- 
tion of DI RNA compared with sgmRNA’ (Figs. 4 and 5) (16), 
suggesting that some feature within it might be required for the 
initiation of new (+)-strand synthesis. The (—)-strand complement 
of the 421 nt, for example, might function to anchor the (—)-strand 
template within the membranous replication/transcription com- 
plex for reiterated (+)-strand synthesis, as has been suggested for 
the (—) strand of tomato bushy stunt virus (40, 41). In the absence 
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of an anchor, sgmRNA (—) strands might be ushered from the 
replication/transcription compartment and thus provide only a 
short-term template function. To the best of our knowledge, the 
only report of posttranscriptional sgmRNA replication is that 
for flock house virus (42), whose features might prove instructive 
for further analysis of coronavirus ssgmRNA. 

Our second finding indicates that the larger sgmRNAs tran- 
scribed from the genome via the template-switching process could 
themselves serve as donor templates for discontinuous transcrip- 
tion and the generation of internally nested ssmRNAs, as depicted 
in Fig. 1, Lower. This mechanism likely would contribute to the 
progressively greater abundance of the 3’-proximal sgmRNAs. It 
also suggests that RdRp template-switching rates at signaling sites 
on the sgmRNA would be influenced by the flanking and other 
more distant enhancer elements known to affect template switch- 
ing on the genome (31, 43-47). 

We envision the biomedical implications of sgmRNA ampli- 
fication in coronaviruses to include the following: 


(i) A cascading mechanism for ssgmRNA production would 
relieve pressure on the large genome as the sole template 
for sgmRNA synthesis and thus might facilitate genome- 
length (—)-strand synthesis. 

(ii) A rapid appearance of large numbers of sgmRNAs 
encoding virulence factors, whether the resulting proteins 
act directly to cause lesions (8) or indirectly to inhibit 
host immune responses (7), could significantly enhance 
early disease development. 

(iii) Although copy-choice recombination between the trans- 
fected marked sgmRNAs and helper virus genome was 
not observed in the experiments of short duration per- 
formed here, much experimental evidence documenting 
coronavirus recombination has unequivocally established 
that it would occur over time and under selective pressures 
(27-30, 48). Furthermore, recombination between corona- 
virus genome and sgmRNAs occurring during natural 
infections has been offered as an explanation for the skew- 
ing of higher recombination rates toward the 3’ end of the 
MHV genome (28, 48). Thus, we postulate that a pool of 
sgmRNAs having been amplified posttranscriptionally 
from a_ fit genome could recombine with a posttranscrip- 
tionally (mutated) debilitated genome to restore fitness. 
The demonstration that coreplicating subgenome-length 
defective picornavirus RNA molecules can give rise to 
a full-length fit standard virus genome through recombina- 
tion (49) lends support to this idea. Thus, a sgmRNA-assis- 
ted, recombination-based fitness restoration mechanism as 
postulated here could, along with a proofreading function 
for the coronavirus-encoded 3’—5’ exonuclease (NendoU; 
nsp 14) (50), contribute significantly to the survival of the 
largest known viral RNA genome. 


Materials and Methods 


Plasmid Construction, in Vitro T7 RNA Polymerase Synthesis of Marked DI RNAs 
and sgmRNAs, Transfection, and Virus Propagation. Construction of pDI RNA-1 
and psgmRNA’-1 (formerly called pDrep1 and pNrep2, respectively) (Fig. 2) 
have been described previously (16). pDI RNA-2 and psgmRNA’-2 (Fig. 2) 
were made by replacing the 288-nt 3’ UTR of BCoV-Mebus (GenBank ac- 
cession no. U00735) in pDI RNA-1 and psgmRNA/’-1 with the 301-nt 3’ UTR of 
MHV-A59 (GenBank accession no. NC_001846) from fragment G DNA (51). 
pDI RNA-3 and psgmRNA’-3 (Fig. 2) are identical to pDI RNA-1 and 
psgmRNA’-1, respectively, except that inserted at the Nsil site is an 836-nt 
sequence containing, in order, an 18-nt intergenic sequence harboring the 
underlined heptameric template-switching signal (5’AATATCTAAACTT- 
TAAGG3’), the 720-nt EGFP gene including a stop codon [from p!IRES1-EGFP 
DNA (Clontech)], a 6-nt Kpnl recognition sequence, and a 92-nt HSV 1-gD 
epitope-encoding sequence (31). In both pDI RNA-3 and psgmRNA/’-3, the 
EGFP gene is in the —1 reading frame with respect to the upstream DI RNA 
and N gene coding regions. For each, uncapped T7 RNA polymerase tran- 
scripts were made from plasmid linearized with Mlul (Fig. 2), treated with 
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RNase-free DNase (Promega), and chromatographed through a Biospin 6 
column (Bio-Rad) before use in transfection (18). 

BCoV was propagated on human rectal tumor (HRT)-18 cells, and MHV was 
propagated on delayed brain tumor (DBT) cells (18, 20). For transfection, HRT 
in 35-mm dishes at ~80% confluency (~8 x 10° cells/dish) was infected with 
BCoV at a multiplicity of infection of 5 PFU per cell and transfected 1 h later 
with 600 ng of transcript RNA using Lipofectin (Invitrogen) (16). 


Head-to-Tail Ligation of Viral (—)-Strand RNA Products and Sequence Analysis 
of the Junction. RNA was extracted with TRIzol (Invitrogen) (18), and one fourth of 
the volume from one plate (~10 1g total per plate) in 25 pL of water was heat- 
denatured at 95 °C for 5 min and then quick-cooled. Then 3 pL of 10x ligase buffer 
and 2 U (in 2 pL) of T4 RNA ligase 1 (New England Biolabs) were added, and the 
mix was incubated for 16 h at 16 °C. Phenol-chloroform-extracted ligated RNA in 
20 pL of reaction buffer was used for the RT reaction with SuperScript II reverse- 
transcriptase (Invitrogen), primer MHV3’UTR99-122(—) (Table $1), and RT primer 
in Fig. 4, which binds 99-122 nt from the polyU tail in the (—)-strand of the MHV- 
A59 3’ UTR. Of this, 5 pL was used in a 50-yL PCR with AccuPrime Taq DNA 
polymerase (Invitrogen), primer BCVL29-54(+) (Table $1), primer 2 in Fig. 4, which 
binds nt 29-54 in the (+)-stand leader of BCoV, and primer MHV3’UTR99-122(—). 
The resulting mixture was heated to 94 °C for 2 min, then subjected to 34 cycles 
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of 30s at 94°C, 30s at 55 °C, and 30s at 72 °C. The ~210-bp PCR product resolved 
by a nondenaturing agarose (2.5%) gel electrophoresis was sequenced either 
directly or after cloning into TOPO XL PCR (Invitrogen). 


Leader-Junction Sequence Analyses of (+)-Strand sgmRNAs Derived from 
Transfected DI RNA and sgmRNA. RNA was extracted with TRizol, and RT- 
PCR reactions were carried out with oligonucleotide 5’GD(+) (Table $1) [the 
RT primer in Fig. 5 which binds within the (+)-strand of the partial HSV 1 gD 
gene (31)] for RT, oligonucleotide leader20(-) (Table $1) [primer 2 in Fig. 5 
which binds the 3’-terminal 20 nt within the (-)-strand of the BCoV leader], 
and oligonucleotide EGFP2(+) (Table $1) [primer 3 in Fig. 5 which binds the 
(+)-strand of the EGFP gene] for PCR. RT-PCR was carried out as described for 
head-to-tail ligation assays, except that 5 pL of the RT reaction mixture was 
used in a 50-pL PCR mix that was heated to 94 °C for 2 min and then sub- 
jected to 29 cycles of 30s at 94 °C, 30s at 55 °C, and 90 s at 72 °C. The 
agarose gel-purified ~150-bp PCR products were sequenced either directly 
or after cloning into TOPO XL PCR (Invitrogen). 
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